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Abstract — This paper presents a novel method for directly 
incorporating user-defined control input saturations into the 
calculation of a control Lyapunov function (CLF)-based walking 
controller for a biped robot. Previous work by the authors has 
demonstrated the effectiveness of CLF controllers for stabilizing 
periodic gaits for biped walkers |2|, and the current work 
expands on those results by providing a more effective means 
for handling control saturations. The new approach, based on a 
convex optimization routine running at a 1 kHz control update 
rate, is useful not only for handling torque saturations but also 
for incorporating a whole family of user-defined constraints into 
the online computation of a CLF controller. The paper concludes 
with an experimental implementation of the main results on the 
bipedal robot MABEL. 



I. Introduction 

Biped locomotion presents an interesting control challenge, 
especially since the dynamic models are typically hybrid 
and underactuated. The method of Hybrid Zero Dynamics 
(HZD) Uni, nil, Cni has provided a rigorous and intuitive 
method for implementing periodic walking gaits in such 
robotic systems, by driving the system to a lower-dimensional 
zero dynamics manifold on which the walking gait exists as 
an exponentially stable periodic orbit. Typical experimental 
implementation of the HZD method has relied on input- 
output linearization with PD control to drive the system to the 
zero dynamics manifold |9|, but recent work by the authors 
has demonstrated that control Lyapunov function (CLF)-based 
controllers can be used to effectively implement stable walking 
with smoother control torques, both in simulation and in 
experimental contexts ||T], ||2l. 

CLF's provide a means for not only guaranteeing exponen- 
tial stability of a system but also providing an explicit bound 
on the rate of convergence. In the case of hybrid systems (such 
as biped robots with impulsive foot-ground impact), a stronger 
convergence property is required, and therefore we turn to 
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rapidly exponentially stabilizing control Lyapunov functions 
(RES-CLF). This type of CLF, which will be reviewed in 
more detail in Section |II] incorporates an additional tuning 
parameter which allows the user to directly control the rate 
of exponential convergence. The work in f2^ established the 
key theoretical properties of CLF controllers in a hybrid 
context, and also presented a description of the sucessful 
experimental implementation of a CLF-based controller on the 
robotic testbed MABEL. In fact, analysis of the experimental 
data showed that the control torques generated by the CLF 
controller were much smoother and less noisy than the control 
torques generated by a comparable PD controller However, it 
was also noted that the user-defined control saturations were 
active throughout a large portion of the walking experiment, 
and that these saturations had a significant impact on the 
actual performance of the CLF controller as compared to the 
predicted performance based on theoretical bounds. In this 
context the hard saturation bounds were "blindly" applied to 
the calculated CLF control torques, with no consideration of 
the potential effect on the CLF controller performance. 

The main contribution of this paper is to present an al- 
ternative method of controller implementation that not only 
preserves the desirable performance characteristics promised 
by the CLF theory, but also respects the user-defined saturation 
bounds on the inputs. To the authors' knowledge, CLF con- 
trollers that respect saturations have not been addressed before, 
and this paper provides a constructive technique for doing so, 
while also presenting an experimental implementation on an 
embedded hard real-time system with a high control rate of 1 
kHz. 

The paper proceeds as follows. In Section HI] we state the 
dynamics of the relevant model and review the results on CLF- 
based control of biped robots from [1] and [2 |. Section |III] 
discusses the adverse effects of user-specified control input 
saturations on the CLF controller, providing the motivation for 
Section HV] which introduces a new method for using quadratic 
programming to appropriately handle torque saturation con- 
straints for the CLF controllers. Section |V]presents simulation 
and experimental results, and we conclude in Section IVTl with 
a summary and discussion of future work. 

II. Control Lyapunov Functions for Hybrid 
Systems Revisted 

In this section we introduce the model for a biped robot 
and review the recent innovations introduced in [[T] and Q 
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for using control Lyapunov functions to control such systems. 
The dynamics for a biped robot (such as MABEL, the robot 
described in Section [VJ can be derived by the standard method 
of Lagrange and take the form 



D{q)q + C{q,q)q + Giq) = Biq)u, 



(1) 



where q E ^ is the robot configuration variable and u is the 
motor control torques. The configuration q for the particular 
case of MABEL is described in |9| and depicted in Figure l5al 
Reformulating the dynamics ([T]i as 



we also define output functions of the form 

y{q) ~ Hoq - yd{d{q)), 



(2) 



(3) 



where 6{q) is a strictly monotonic function of the configuration 
variable q. The method of Hybrid Zero Dynamics (HZD) aims 
to drive these output functions (and their first derivatives) to 
zero, thereby imposing "virtual constraints" such that the sys- 
tem evolves on the lower-dimensional zero dynamics manifold, 
given by 

Z = {{q, q) e I y{q) - 0, Lfy{q, q) = 0}. (4) 

If y{q) has relative degree 2, then the second derivative takes 
the form 

y ^ Ljy{q,q) + LgLfy{q,q)u, (5) 

where the decoupling matrix LgL fy{q, q) is invertible due to 
the vector relative degree assumption. Then defining 

u*{q, q) -{LgLfy{q, q)y^L]y{q, q), (6) 

and applying a pre-control law of the form 

u{qA) = u*{q,q) + ^1 



or 



u{q,q) ^ u*{q,q) + {LgLfy{q,q)) ^fi 



(7) 



(8) 



renders Z invariant (provided /i vanishes on Z). (Note that 
u*{q,q) is a feed-forward term representing the torque re- 
quired to remain on Z.) 

Under these assumptions, the dynamics ^ can be decom- 
posed into zero dynamics states z E Z and transverse variables 
V — [y y]- (See lfT2l . 161 for details.) Under a pre-control 
law of the form d?) or dHJ, the closed-loop dynamics in terms 
of {rjjz) take the form 



p{r],z). 



(9) 

(10) 



For the work presented here, we will use the pre-control law 
dHJ so that /(t], z) — Ff] and g{ri, z) — G, where 



F ^ 



/ 




(11) 



The most common approach to controlling the trans- 
verse variables (i.e. driving 77 to zero) relies on input- 
output linearization with PD control, using dHJ with fi = 



[ —-^Kp —^Ko ] rj, where Kp and Kp, are diagonal 
matrices chosen such that 



A 





-Kp 



I 

-Kd 



(12) 



is Hurwitz. Recently, a new method based on control Lyapunov 
functions has been introduced in |[T], ||2l, which provides an 
alternative method for controlling the transverse variables. 
That method can be summarized as follows. 

A function V^{vi) is a rapidly exponentially stabilizing 
control Lyapunov function (RES-CLF) for the system (|9]l if 
there exist positive constants ci , C2 , C3 > such that for all 
< e < 1 and all states (77, z) it holds that 



ci\\vr<vM<%M\' 



inf 



C3, 



LfV,{r], z) + L-gVeirj, z)fi + —Veiv) 



< 0. 



(13) 
(14) 



One way to generate a RES-CLF V^{j]) is to first solve the 
Lyapunov equation A^P+PA = —Q for P (where A is given 
by ( fT2b and Q is any symmetric positive-definite matrix), and 
then define 





/ 
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for which we have 








T 

= V 




LgVe{r],z) = 


--2^ 


^PeG. 



(15) 



(16) 

Associated with a RES-CLF is the set of all /i for which 
(O is satisfied, 

K4r,,z) = {fieU: Lf-V,{r,,z) + L-,V4ri,z)li+-V4v) < 0}, 

e 

and one can show that for any Lipschitz continuous feedback 
control law i.ie{rj,z) £ Ke{rj,z), it holds that 



\\vm<-.[^e'^'UO)\\, 
e \ ci 



(17) 



i.e. the rate of exponential convergence to the zero dynamics 
manifold can be directly controlled with the constant e through 
There are various methods for finding a feedback control 
law ^^{ri,z) £ K^{r],z); in practical applications, it is often 
important to select the control law of minimum norm. If we 
let C3 = ^"""W) ajjjj define 



(18) 



then this pointwise min-norm control law |4 | can be explicitly 
formulated as 



wherein we can take ^ = ji^ m dHJ. 



if V'0,e('7,2) > 

if ■ipo,e{v,z) < 0, 



(19) 
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Fig. 1; Motor torques (from the MABEL experiment described 
in ||2l) for the stance and swing legs for 4 consecutive steps 
of walking with the minimum-norm CLF controller given in 
(T% . The thicker plots indicate the experimental (saturated) 
torques, while the thinner plots are the raw (unsaturated) 
torques computed by the CLF controller For the leg angle 
motor (top graph), the raw (desired) control torque is at times 
more than 400% of the (actually implemented) saturated value. 
Moreover, this occurs over a significant duration of the step. 



in. Adverse effects of torque saturation on the 
CLF controller 

The approach described in Section |II] was successfully 
implemented on the robotic testbed MABEL, producing a 
stable walking gait with motor torques which were smoother 
than a comparable approach based on the PD control method. 
(See 121 for a description of the experiment and a reference 
to the online video.) However, analysis of the experimental 
data reveals that the user-imposed saturations on the control 
torque inputs were active throughout much of the experiment 
(see Figure [1) and significantly affected the implementation of 
the CLF control method. Though necessary to prevent unsafe 
or damaging motions, these saturation constraints were not 
applied in a manner that appropriately preserved the qualities 
of the CLF controller, and therefore the nominal bounds given 
by ( fT4l l and ( fTTj l were frequently violated. 

Saturation bounds for control inputs are typically imposed 
by the user either as a measure of safety or out of physical 
necessity, as in the case when current draw must be limited 
due to a dying battery. When the calculated ideal control effort 
frequently exceeds the prescribed saturation bounds, the con- 
troller performance is degraded and theoretical performance 
measures may be violated, as in the experiment described 
above. More importantly, when a control input is saturated, the 
system runs in open-loop and is no longer able to respond to 
increasing errors in tracking, often leading to eventual failure. 

Designing controllers which respect such saturation bounds 
is important (especially in the experimental context), and 
therefore a variety of approaches have been developed, such as 
quasi-linear control |3 |, which offers one solution for a special 
class of systems. The main objective of the current work is 
to present a method for implementing CLF-based controllers 
for a general class of nonlinear systems in a manner which 
respects the user-specified input saturations. 



IV. Formulating the CLF Min-Norm controller as 
A Convex Optimization 

To design such a controller, we proceed by recognizing that 
the pointwise min-norm controller in (fT9) can be equivalently 
expressed as a convex optimization problem formulated as 

T 

mm /i 

^' (20) 

s.t. Vo,e('7,z) + V'l,e(»7:2;) < 0. 

The inequality constraint above enforces the bound on the 
time-derivative of the CLF given by (fl4] i. which can be equiv- 
alently expressed as Vg^rj) < — cs/e Vs{r]). The solution of 
this convex optimization problem is then exactly the controller 
specified in ( fT9l ). 

Once we have expressed the pointwise min-norm controller 
as a convex optimization problem, we can introduce satura- 
tion bounds on the control input in the form of additional 
constraints for the convex optimization problem. However, for 
these additional constraints to be satisfied, we first need to 
relax the bound on the time-derivative of the CLF. We do this 
by requiring Veiv) ^ ^c^/s Ve{r]) +di, where di is typically 
a small positive quantity. We then introduce input saturations 
in the form of (soft) bounds, such that the control input u in 
(O satisfies Umin^d2 <u< Umax + ds, with d2,d3 typically 
small positive quantities. The new optimization problem is 
formulated as 

min ^^fi + pi dl+ p2 {d^ + df) 

^,d\,d2.d^ 

S.t. Vo,e(??,2) + V'l,e(f?, 2^) A* < C^l, (21) 

{LgLfuiq, q)y^ fj. > {umm - u*) - d2, 
{LgLfuiq, q)y^ fj. < {umax - u*) + ds, 

where pi,p2 are large positive numbers that represent the 
penalty of relaxing the inequality constraints and u* is de- 
fined by (|6]l. Typically, we choose p2 > pi to ensure that 
violation of the saturation bound on control input is penalized 
far more severely than violation of the bound on the time- 
derivative of the CLF. Note that the constraints on the control 
inputs enforce Umm ~ ^2 < u* + v < Umax + ds^ where 
V = {LgL fy{q, q))^^ fi, i.e., the bound is on the total control 
input. 

The above formulation provides the control designer with 
parameters to trade off violation of the bound on the time- 
derivative of the CLF with that of the saturation bound on the 
control input. However, in most cases the bounds on the inputs 
appear as hard saturation bounds which cannot be relaxed. 
In this case, the optimization problem can be redefined to 
perfectly satisfy torque bounds as, 

min /.t^A* + Pi d\ 

fj,,di 

S.t. ipo^e{v,z) +ipi.e{v,z) fJ. < di, (22) 
{LgLfy{q, q)y^ > [umin - u*), 
{LgLfy{q, q)y^ < {umax - u*). 

Remark 1: Note that in both ( |2TI ) and ( l22l i we have de- 
picted u„im and u„iax as constants. However, since the convex 
optimization problem is to be solved at every instant in 
time, these values can be specified as functions of time or 
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system state, leading to dynamic torque saturation. As will be 
discussed in the next section, for periodic motions (such as 
bipedal walking), this provides the option to define a control 
bound that varies according to the system state location along 
the periodic orbit. 

V. Simulation and Experimental Results 

In this section we present both numerical simulation and 
experimental results to validate the performance of the control 
methods described in Section |IV] Since experimental testing 
on MABEL was the ultimate goal, the numerical studies were 
conducted first on a simple model of MABEL, followed by 
simulations on a complex model of MABEL developed in 
ll8l. which closely repUcates the experimental setup. This 
latter model includes a compliant ground model as well 
as a model that allows for stretch in the cables between 
the transmission pulleys. MABEL is a 5-link bipedal robot 
with point feet and series-compliant actuation for improved 
agility and energy efficiency. The experimental setup has been 
described previously in [9| and is illustrated in Figure |5] 
For the simulations and experiments described here, the four 
output functions in Q were defined by the absolute pitch angle 
of the torso, the leg angle (LA) for the swing leg, and the 
appropriately scaled leg-shape motor position (mLS) for the 
swing and stance legs. The four control inputs are the leg-angle 
motor torque (umLAati ""mLAaw) and leg-shape motor torque 
(WniLSst I ""mLSaw) f^i" ^he Stance and swing legs respectively. 




Fig. 2: Lyapunov function and its derivative for the numer- 
ical simulations described in Section IV-AI The figures depict 
the results for four different cases of input saturation bounds. 
As can be observed in the plot of V^, Case C leads to instability 
in the walking gait. 



A. Numerical simulation 

The numerical simulation results presented here employ the 
CLF controller with hard input saturation, as in (l22i . We 
consider four separate cases with different control saturation 
bounds, given by 



A 



B 



C 



D 



-8 
-12 
-8 
-12 

-5' 

-8 
-2 
-2 

■-4 
-8 
-2 
-2 



< 



< 



< 



"mLAat 
MraLS.t 
""mLA.w 
"niLSaw 
MinLAa, 

WmLA™ 
"mLS™ 

UmLA™ 
"mLS™ 



< 



12 



12 



< 



< 



"-4" 




"mLA,t 


-7 


< 


"mLSst 


-1 




MmLA™ 


_-l_ 




.MmLS™. 



< u*{9) + 



where u*{9) is the nominal value of ^ along the periodic 
orbit, regressed as a 5th-order Bezier polynomial of 9{q). Note 
that in case D, the bounds are specified dynamically as a 



function of the state of the robot, resulting in dynamic torque 
saturation. 

Simulations of a representative walking step with the con- 
troller (l22t were run for each of Cases A-D; the corresponding 
RES-CLF Ve and its time derivative are presented in Figure 
|2] and the resulting input torques and tracking errors are 
illustrated in Figure |3] The saturation effects are most visible 
in the stance leg angle plots; as expected, more restrictive 
saturation bounds result in increased tracking error However, 
we observe that the degradation in performance is gradual. 
Note that case C leads to instability in the walking gait, as is 
evidenced in the plots of the tracking error in Figure |3]as well 
as the Lyapunov function in Figure |2] 

To illustrate the effect of saturation on the walking limit 
cycle, we also carry out simulations on the complex model of 
MABEL. We use the controller given by (|22] | in closed-loop 
and analyze the phase portrait of the torso angle, subject to 
several different saturation values. Figure ^illustrates the torso 
phase portrait for 15 steps of walking, and we observe that 
stricter saturations result in (gradual) deterioration in tracking, 
as evidenced by deviations of the limit cycle from the nominal 
orbit. The saturation values used here differ from those used in 
the simulations described in the first part of this section, since 
the complex model significantly differs from the simple model 
and the required torques for walking are different. However, 
the saturations are notationally similar, with the saturations 
becoming more restrictive as we go from Case I to Case IV 
(as described in the caption of Figure |4|l. 
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Fig. 3: (a) Motor torques for the stance (top two figures) and swing legs (bottom two figures), and (b) Corresponding errors 
in tracking the output (|3]l, based on the numerical simulations described in Section IV-AI Each figure depicts the results for 
four different cases of input saturation bounds. As can be observed in the tracking error plots, Case C leads to instability in 
the walking gait. 
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Fig. 4: Phase portrait of the torso angle for four different 
cases of input saturation bounds: Case I corresponds to 
—8 < UmLA < 8, —12 < UmLS < 12, Casc II corresponds to 
—6 < UmLA < 4, —6.5 < itmLS < 4, Casc III corresponds to 
—4 < UmLA < 2, —6 < UmLA < 3, and Case IV corresponds 
to a dynamic torque bound that is a function of the state of 
the robot. Observe that stricter saturations result in (gradual) 
deterioration in tracking, as evidenced by deviations of the 
limit cycle from the nominal orbit. 



B. Experimental results 

Motivated by the favorable numerical simulation results, 
we proceed to test the controller experimentally on MABEL. 
Experimental implementation of the CLE controller at real- 
time speeds is a challenging task, since it requires computation 
of the system dynamics (|2]), the Lie derivatives of the output 
([3]i, and the CLE controller terms (IT¥I . as well as the solving 
of a convex optimization problem. In order to meet hard 



real-time constraints of 1 kHz, these computations must be 
completed in less than 1 ms. By employing the custom- 
code generation method CVXGEN |7| for solving constrained 
quadratic programs, we are able to solve the optimization 
problem in a few hundred microseconds and meet the 1 
kHz update requirement, making experimental implementation 
feasible. 

Two walking experiments were performed with MABEL. 
The first experiment employed the control method presented 
in (I2TI 1. implementing soft bounds on the control torques with 



Pi 



50, p2 — 75, and u 



mm : ^max 



chosen such that 



< 



WmLA < 8, — 12 < UmLS < 12- The controller executed 169 
steps of stable walking, before the experiment was terminated 
by the operator. Figure |6]illustrates the resulting torques. Note 
that the section of the plots highlighted in green illustrate the 
points at which the user-defined soft bounds on control inputs 
are relaxed, as the controller trades off strict adherence to the 
control bounds for better performance with regards to the CLE 
bound. Figure |2] illustrates the Lyapunov function and its time 
derivative for this experiment. 

In the second experiment, we implemented the CLE con- 
troller with hard control saturation, as in (l22t . with the same 
CLE-bound penalty as in the first experiment (i.e., pi — 50) as 
well as the same choices for Umin and Umax- This experiment 
resulted in 70 steps of walking for MABEL and is portrayed 
in the video in Q. (A photo sequence depicting one represen- 
tative step is also shown in Figure [8]) Figure |9] illustrates the 
resultant control torques; we observe that the user-specified 
control bounds are respected, as evidenced by the flattened 
control signals at the boundary areas. Note that the green 
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(a) Coordinates 



(b) MABEL experimental setup 



Fig. 5: Experimental setup for bipedal robot MABEL and 
associated coordinates. (From ||9]-) 
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Fig. 7: Lyapunov function and its time-derivative for 4 con- 
secutive steps of walking with the CLE controller with convex 
optimization and soft constraints on saturations. 
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Fig. 6: Motor torques for the stance and swing legs for 4 
consecutive steps of walking with the CLE controller with 
convex optimization and soft constraints on saturations. The 
convex optimization was tasked to enforce the magnitude of 
the LA and LS motor torques to be within 8 Nm and 12 Nm 
respectively. The green highlights on the plots indicate regions 
where the user-specified torque bound was exceeded by the 
convex optimization. 



squares on the plot depict the time instances at which control 
bounds are not met, which occur at moments in which the 
convex optimization algorithm is not able to converge within 
the specified time constraints. These occurrences are isolated 
and have no affect on the experimental system since a motor is 
not able to respond to them. Figure [TO]illustrates the Lyapunov 
function and its time derivative for this experiment. 

VI. Conclusion 

We have presented an alternative method for implementing 
the pointwise min-norm CLE controller described in ([19) in 
a manner that more appropriately handles input saturations, 
whether those user-defined constraints are strict or soft. Nu- 
merical simulation as well as experimental implementation 
has demonstrated that these control methods can be very 
useful in practice, even in systems which require a high real- 
time control update rate. This method has great potential for 
effectively dealing with saturations in a variety of contexts, 
such as power-limited systems which could progressively 




Fig. 8: A photo sequence depicting one representative step 
from the second experiment described in Section IV-BI 
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Fig. 9: Motor torques for the stance and swing legs for 4 
consecutive steps of walking with the CLE controller with 
convex optimization and hard constraints on saturations. The 
convex optimization was tasked to enforce the magnitude 
of the LA and LS motor torques to be within 8 Nm and 
12 Nm respectively. The green square markers on the plots 
indicate the time instances at which the user-specified torque 
bound was exceeded by the convex optimization. This occurs 
when the convex optimization fails to converge within the 
maximum number of allowed iterations, a limit required to 
ensure the hard real-time constraints are met for experimental 
implementation. 
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Fig. 10: Lyapunov function and its time-derivative for 4 
consecutive steps of walking with the CLF controller with 
convex optimization and hard constraints on saturations. 

lower user-defined torque saturations as the battery charge 
decreases, thereby prolonging the last bit of battery charge 
while allowing system performance to gracefully degrade. 
In addition to dynamic torque saturation, we also note that 
this approach provides a method for incorporating a whole 
family of user-defined constraints into the online calculation 
of controller effort for the types of systems described here . 
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